Keyword Suggestion Using Concept Graph Construction from Wikipedia Rich Documents
نویسندگان
چکیده
Concept graph is a graph in which nodes are concepts and the edges indicate the relationship between the concepts. Creation of concept graphs is a hot topic in the area of knowledge discovery. Natural Language Processing (NLP) based concept graph creation is one of the efficient but costly methods in the field of information extraction. Compared to NLP based methods, Statistical methods have two advantages, namely, they are language independent and more computationally efficient. In this paper we present an efficient statistical method for creating a concept graph from a large document collection. The documents which are used in this paper are from Wiklipedia collection because of their rich and valid content. Moreover, we use the final concept graph to suggest a list of similar keywords for each unique concept or combination of concepts to find deeper information to help information extraction. Also, we will show the viability of our approach by comparing its result to a similar system called the Wordy system.
منابع مشابه
Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles
When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...
متن کاملLearning concept graphs from text with stick-breaking priors
We present a generative probabilistic model for learning general graph structures, which we term concept graphs, from text. Concept graphs provide a visual summary of the thematic content of a collection of documents—a task that is difficult to accomplish using only keyword search. The proposed model can learn different types of concept graph structures and is capable of utilizing partial prior...
متن کاملLocation Aware Keyword Query Suggestion Based on Document Proximity (Extended Abstract)
Consider a user who has issued a keyword query to a search engine. We study the effective suggestion of alternative keyword queries to the user, which are semantically relevant to the original query and they have as results documents that correspond to objects near the user’s location. For this purpose, we propose a weighted keyword-document graph which captures semantic and proximity relevance...
متن کاملKeyword Query Suggestion Based on Document Proximity
In this paper, we design a location-aware keyword query suggestion framework. We propose a weighted keyword-document graph, which captures both the semantic relevance between keyword queries and the spatial distance between the resulting documents and the user location. The graph is browsed in a randomwalk-with-restart fashion, to select the keyword queries with the highest scores as suggestion...
متن کاملEQUAL: Encyclopaedic QUestion Answering for Lists
This paper presents EQUAL , a semantic question answering (QA) system which relies on structural information from Wikipedia to extract answers to list questions in the GikiCLEF 2009 task. The system is described in the context of of a more general architecture for semantic QA. Unlike the usual textual QA approach, EQUAL does not rely on identifying the answer within a text snippet by using keyw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008